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Abstract. Let A C {1, . . . , A} be a set of prime numbers containing no non-trivial 
arithmetic progressions. Suppose that A has relative density a = \A\/ir(N), where 
ir(N) denotes the number of primes in the set {1, . . . , A}. By modifying Helfgott and 
De Roton's work [7], we improve their bound and show that for every e > 

A 



a <C C 



(log log A) 1 " 6 ' 



1. Introduction 

In 1936, Erdos and Turan [3j conjectured that if a set A C N = {1, 2, 3 ... } contains 
no k term arithmetic progressions, then it cannot be "too large." We say that AcN 
has positive (upper) density if for some e > 



lim sup — 1a(x) > 



x— >oo X 

n<x 



and throughout this section we will exclude those trivial arithmetic progressions with 
difference 0. In 1953, Roth [8j proved that if a set A C N = {1, 2, 3 ... } contains no 
non-trivial arithmetic progressions, then A has density 0. Quantitatively, he showed 
that 

N 

L4n{l,2,...,AT} < 

log log N 

Roth's Theorem has been improved significantly over the last 60 years by Heath-Brown, 
Szemeredi, Bourgain, j6j [10J [lj |2] and most recently Sanders [9j who obtained 

N (log log N) 5 



L4n{l,2,...,iV}| < 



logN 



Moving to the set of prime numbers, which we will denote V, we define the relative 
density of a set A C V up to to be 



;i.l) a(N) 



\An{l,2,...,N}\ 
\Vn{l,2,...N}\ ' 



In 1939, Van Der Corput [TT] showed that V contains infinitely many non-trivial three 
term arithmetic progressions. 64 years later, in 2003, Green [4] proved an analogue 
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of Roth's Theorem inside the primes, showing that if A C V contains no non-trivial 
arithmetic progressions, then 

i 

(aj\ ^ / log log log log log \ 2 

a(N) < — : : — - 

\ log log log log N J 

Helfgott and De Roton [7] improved this density bound, removing two log's from the 
denominator, obtaining 

(1.2) a(N) « lQglQglQgi y . 

(log log N) 3 

Their result implicitly uses the best quantitative bound on Roth's Theorem in the 
integers, and when the proof is run through again with Sander's bound, the density 
recovered is 

(1.3) a(N) « ""g 108 ' ^ 1 . 

(log log N) 2 

Our main result is the following: 

Theorem 1. Suppose that A C V fl [1, N] contains no non-trivial arithmetic progres- 
sions. Then for any e > 0, the relative density a of A inside Vn[l, N] is bounded above 
by 

1 

(log log N) 1 -* 

The notation <C<e means that the constant is allowed to depend on e. 

Our proof parallels that of Helfgott and De Roton, but a factor of two is gained in 
the exponent by using the L k norm rather than the L 2 norm of the convolution of the 
indicator function of the set of primes and the indicator function a set E. Using this 
higher norm introduces several combinatorial difficulties which are dealt with in section 
[2] and in the proof of proposition [2j This L k norm bound tells us that we cannot have 
large outliers in a more precise way, and allows us to choose a larger subset on which the 
convolution is uniformly bounded from below. As in Helfgott and De Roton, the bound 
on Roth's Theorem yields a lower bound on the the size of the three term progression 
operator applied to this uniform set. If the set E is chosen correctly, the three term 
progression operator of the convolution cannot be too far from that of the indicator 
function of the primes, which gives the desired density bound for the primes. 

1.1. Preliminaries and Notation. Given a function / : Z/7VZ — > C, where is a 
prime, we define the Fourier transform to be 

f(t) = E xeZ/m f(x)e 2 ^ t / N . 

The convolution operation is given by 

(1-4) (f*g)(x)=E yeG f(y)g(x-y), 

which is suitably normalized so that 

(1-5) f7g(t) = f\t)g(t). 
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We define the norms 

||/|| Lfc(G) =E a;eG |/( a ;)| fc , 

and 

11/11^ = ^1^)1* 

When there is no ambiguity, we will omit the notation £ k (G) and L k (G), and sim- 
ply write || • \\k- We will make use of the fact that the inner product (/, g)h 2 {G) — 
^xez/Nzfi^gix), satisfies Plancherel's identity 

(1-6) ^xexez/Nzf(x)g(x) = /(*)£(*)> 



from which we obtain ||/||l 2 (g) = ||/|^ 2 (G)- Given functions f,g,h:G—^C,we let 
A(f,g, h) denote the three term arithmetic progression operator defined by 

A (/, g, h) = E x4eG f(x)g(x + d)h(x + 2d). 
2. Sieving the Primes 



N 
log A" 



Let Ac [1, AT] be a subset of the primes with relative density a so that \A\ ~ a 

and suppose that a > -3^f- Then in what is known as the U W trick" we may obtain a 

density increase of log i og N by sieving out the smallest primes. Let W = Yl p < z P ^ e the 

product of the primes less than z, and choose z — ~ log A" so that W ~ is a small 
power of N. Splitting into the different arithmetic progressions modulo W, there will 
be exactly 4>(W) nontrivial residue classes. By the pigeon hole principle there exists an 
arithmetic progression AP{b) = {b + nW : 1 < n < § } with 

(2.1) |AP(6)nA|> a A ' 



log N(f)(W) 

where the z on the right hand side appears because AP(b) starts with n — 1 rather 



than n — 0, and we are not counting the first z primes. Let P be the least prime larger 
f , so that P < f , 



then so that P < ^, and let A C [1, P] be the set 



A = < n = : m € AP(b) H A 

[ W 

noting that an arithmetic progression in A Q can be lifted to a progression in A. Since 

W 



and P we see that 3> -Plog-z, and so by equation 12.11 we have that 

14.1 » »fe 

log A/ 
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Define a = — li n to be the normalized indicator function for the set An. This function 

log z ^0 u 

satisfies 

||a||i = E n&p a{n) = a, 

ail< ^ 1 N 

||a||2 = E nGZp a(n) 2 = E neZp a(n) = 

In section [3] we will examine the key quantity A(a,a,a) in detail. To do this, we will 
first need a bound on the L k norm of the convolution of a, which is discussed in the 
following section. 

2.1. Bounding the L k norm of the convolution. Our goal is to provide bounds on 
the L 2k norms of a * a where a is the indicator function of a set S. 

Proposition 2. Let k > 1 be an integer, and suppose that a = is the normalized 
indicator function of a setH C [— j, j] c ^p- Then we have that 

'logiVY - ^ 



a * ^ 2fc <fc 1 



2fc , 
— 2fc 



log 2; 

We will make use of a corollary of Theorem 5.7 from Halberstam and Ri chert [5]: 

Lemma 3. Let g G N, and suppose we are given < M < 3x and distinct a\, . . . ,a g 
each co-prime to W with magnitude less than 3x. Let p(p) be the number of solutions 
to 

(Wn + ax) (Wn + a 2 ) • • • (Wn + a g ) = mod p, 
and let A = \{n < x : Wn + prime for i = 1, . . . , g} \ . Then 

where the constant depends only on g. 

With this lemma, we prove proposition [2j 

Proof. By the definition of the L 2fc norm and the convolution, we have that 

\\a * a\\l k k = E x \ (a * a) {x)\ 2k = E x \E y a(y)a(x - y)\ 2k . 

Expanding the sum, the quantity \\a * is bounded above by 

^yi,-,y2k\ a (yi)\ ' ' " \o-(y 2 k)\E x a(x - y\) ■ ■ ■ a(x - y 2k )- 

Since a is supported on [0, -j] , and a is supported on [ — j, j] , there can be no wrap 
around inside Zp, and we have the upper bound 

1 /logiV\ 2/c 

(2.2) E x a(x - yi) • ■ ■ a(x - y 2k ) < -= \A (yi, y 2 , ■ ■ ■ , y 2 k)\ 

P V. log z J 

where 

A(y!,y 2 ,..., y 2k ) = {n < P : b + (n - y x ) W, b + (n - y 2 fc) W all prime} . 
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The size of this set of primes is bounded above by the number of n < P such that 
b + (n — yi) W is prime, and so 

(2.3) \A ( Vl , y 2 , . . . , y 2fc ) | < ■ , , DT . A « P / 



W log(PW) Viog^, 
However we will need our sieving lemma in the case where each of the y^ is distinct. 
Let 

(yx, ... , ?/2fc) : 2/i < — , with / or less distinct coordinates > , 



r p 

= S (2/1 ; • • • j U2k) '■ yi < "2") with exactly / distinct coordinates ^ 

so that we may bound our quantity above by a sum from r = 1 to 2/c, and over 
(2/1 j • • • j2/2fc) ^ J r or J r . We split this into two cases. When r < 2k, we will bound 
above by the sum over I r , and when r = 2fc, we will use a sum over elements in the set 
J 2k- For r < 2&, we are looking at 

^ 2fc-l 

( 2 - 4 ) pWT 1^(2/1)1 ' ■ • kC^)! — 2/1) • • -a(x — ?/ 2A; ) 

r=l (jtt,...,»2fc)6-fr * 

which by equations 12.21 and 12.31 is 

/-, A7 -\ 2fc-l t 2fc-l 



Since 



r=l (yi,...,v 2 fc)eir 



^ |0"(3/l)| • • • k(2/ajfe)| = I V |L-r ( E 2/i,-,2/r-°" (yi) ' ' ' ° (Vr)) 



p2k 



1 



isi 2fc - 



2fe-l 



we have that the quantity in equation 12.41 is 

1 / log 

<<fc ]E| 

For J 2 fc ; we use Lemma [3] to attain the upper bound 

(2.5) \A (y u v, . . . , y»)\ «* n (l - PJ ^T) (l - J) 



-9+1 



for any tuple (yi, . . . , i/2fe) € J2fc- On the right hand side of equation 12. 5[ p is defined so 
that p(p) = if p < z, p{p) = 2k if p > z and p\ {y% — yj) for all i 7^ j, and p(p) > 1 
if p > z and — yj) for some z 7^ j. Since — yj\ < P, and any integer y < P 
can have at most log 2 P prime factors greater than z, we see that there are at most 
(2k) 2 primes greater than z which could divide some difference yi — yj. This yields 
the upper bound 
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< (log z) 



v 4fc 3 ^ 

z \ g z 



and as <Cfc z , combined with the fact that f-^-r) X < e c , the above is bounded 

logz ' \x—l/ — ' 

by a constant depending on k. It follows that for any (yi, . . . , y^k) G J2/C, 

E^afx - y x ) ■ - ■ a(x - y 2 k) <fc 1- 

Since 

\ a (yi)\ ■ • • kMI < E wi,...,»J°'(2/i)l ■ • ■ W(y2k) \ = 1, 

the sum over all the 2/c-tuples in J2A; is <C 1. Combining the work done so far, we have 
proven that 



\a*a\\ 2 2 k k < fe 1 + 



1 /log.Y x " 



£ I V log z 



The stated result then follows from the fact that (n + m) 2k < nzk +7712* for n,m,k > 
1. □ 

3. Main Theorem 
Let a, iV, W, P and z be defined as in section EJ Following [7J, let 
R = Spec 5 (a) U {1} = {x G Z/PZ : \a(x)\ > 5} U {1}, 

and 

P = B(R,e) = |n G Z/PZ : Vx G P, || — 1| < ej . 

The set P is called a Poftr set with radius e and frequency set P. Set cr = j^^b to 
be be the normalized indicator function of the Bohr set P. By including the element 1 
in the set P, it follows that a will be supported on [— j, t] inside Zp when e < |. Let 
ft, = a * a be our prime indicator smoothed out by the Bohr set P. Our goal is to show 
that there is little difference between the three term arithmetic progression operator 
applied to a and ft. Notice that 

IHI 1 = 7 m ^eZplg = 1, 
/i(P) 

and 

\\h\\i = Ea-ez Ey £Z a(x - y) 

MP) 

= ||cr||i||a||i = a. 

Let 

A = I A (a, a, a) - A (ft, ft, ft) | 
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where 

A(/, g, h) = E Xid f (x) g(x + d)h(x + 2d) 
is the three term arithmetic progression operator. In Helfgott and De Roton's paper 
[7J, equation (2.6) on page 7 states that 

Lemma 4. For the above definition of A, e, 5 we have 

A < e + 5. 

Applying proposition [2] with a = a, we find that 

\\a * a\\ 2 <^ 1, 

and so ||a|| 4 <C 1 since ||a*a||| = ll^lll by II. 61 and II. 51 This yields the bound \R\ <C 5~ 4 
on the size of the dimension of the Bohr set B since 

|i*|<J 4 <53|a(t)| 4 «l. 
t 

A well known pigeon hole argument tells us that \B (R, e) | > A/e'^', and so 

log \B\ > \ogN - |J?||loge| > logA^ - 5' 4 \ loge|. 

We note that an equation nearly identical to the above appears on page 9 of [7]. We 
chose to deduce it again since the bound on ||a||4 was obtained in a different way, 
without using Green and Tao's results on the Selberg sieve. From now on, we will 
assume that e, S satisfy 

(3.1) <T 4 |loge| < -\ogN. 



2 



In this case, we have that 



\B\ > > fc 



log N 
log z 



2k 



and so by Proposition |2l 

(3.2) \\hhk= \\a*a\\ 2k < fe 1. 

Using this bound on the L k norm of h = a * a, we prove that A(h, h, h) must be large. 
Proposition 5. Given a positive integer k > 2, we have that 

1 




A (h, h, h) > fc exp I -c{k)a q2h [log 

where q 2 k = (l — kt) 1 , and c(k) is a constant dependent on k. 

To prove this proposition, we will make use of the following lemma which allows us 
to find a large subset where the function h is bounded below uniformly. 

Lemma 6. Let q,p > 1 be such that - + - = 1, and let f : Zp — > IR + be a function with 
\\f\\i = a, and I/Hp < C for some C . Then there exists a subset L C TLp such for all 
n G L f{n) > ~, and 



(-Y 

\2CJ 



<//(L). 
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Proof. Define L = \n : f{n) > so that L is the largest possible set satisfying the 
second condition. Then, if lz(n) is the indicator function for L we have that 

a < E neZp f{n) <7j + E neZp f(n)l L (n). 

Applying Holder's inequality yields 

E n&p f(n)l L (n) < \\l L {n)\\ q \\f(n)\\ p <C{»{L))< 
by our assumption that ||/|| p < C. The lemma then follows from the resulting inequality 

f<c(M£)) 1/9 . □ 

Sanders improvement to Roth's theorem [9] states that if H C [1, A] with density £, 
then 

(3.3) A (l s , l s , 1 5 ) » exp ^-cC 1 (log \ . 

Using this result along with equation 13.21 and lemma we are ready to finish the proof 
of proposition |5j 

Proof. By equation 13. 2[ it follows that \\h\\zk 1- Applying lemma [6] to the function 
h, we obtain a subset L C Zp with 

fi (L) > fc 

and h{n) > ^ for all n G L where ^- = 1 — This implies that 

a 3 

(3.4) A(h,h,h)>—A(1 L ,1 L ,1 L ), 
Applying the bound in equation 13.31 to our set L, we have that 



A (1 L , 1 L , l L ) > exp -c«- 92fc log 



l^ 5 



a: 

for some constant c depending on fc, since the density of L is 3>*. a q2k . By equation 13. 4[ 
we have that 

A (h, h, h) > exp (^-c(k)a~ q2k (log - 

for a different constant c(k) dependent on k, as desired. □ 
Lemma |4] tells us that 

\A(h, h, h) - A(a, a, a) | < e + 5, 

and Proposition [5] implies that A(h, h, h) must be very large. Recalling equation 13. 1[ 
the requirement that e, 5 satisfy 5~ 4 |loge| < |logA, we are ready to put everything 
together and give a precise lower bound for the size of A(a, a, a). We now prove Theorem 

m 
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Proof. Suppose that A C V contains no nontrivial arithmetic progressions. Then Aq 
contains only the trivial 3 term arithmetic progressions, and we have that the three 
term arithmetic progression operator is bounded above by 

(3.5) A ( a , a , a )«I('^ X 

P \ log z 

By equation 13.51 lemma H] and proposition |5] , we must have 

logiV\ 3 c (-of, 1 X5% 

+ e + o 3>fc exp — ca q2k log — 



P V log z I \ \ a 



In the above, the term ^^rj) wm be negligible compared to the right hand side 



r 5N 



provided that a > — - — r . Letting 

(log TV) 3 

e = S = K exp ( —ca~ q2k ( log — 

we will have a contradiction if K is taken small enough. These values of e, 5 satisfy the 
necessary constraint |loge| 5~ A < | log iV as long as 

1\ 5 \ 1 



exp ( c 2 a q2k [log -J J <^ k -\ogN 

for some new constant Ci- The above holds when 

(log log log AT) q 2k (log log log AT) v 2k - 
a >fc 1 — > 



(log log N) «™ (log log A^) 1 

and by taking k sufficiently large, we obtain the main result for any e > 0. □ 
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